An Ensemble Method to Distinguish Bacteriophage Virion from Non-Virion Proteins Based on Protein Sequence Characteristics
نویسندگان
چکیده
Bacteriophage virion proteins and non-virion proteins have distinct functions in biological processes, such as specificity determination for host bacteria, bacteriophage replication and transcription. Accurate identification of bacteriophage virion proteins from bacteriophage protein sequences is significant to understand the complex virulence mechanism in host bacteria and the influence of bacteriophages on the development of antibacterial drugs. In this study, an ensemble method for bacteriophage virion protein prediction from bacteriophage protein sequences is put forward with hybrid feature spaces incorporating CTD (composition, transition and distribution), bi-profile Bayes, PseAAC (pseudo-amino acid composition) and PSSM (position-specific scoring matrix). When performing on the training dataset 10-fold cross-validation, the presented method achieves a satisfactory prediction result with a sensitivity of 0.870, a specificity of 0.830, an accuracy of 0.850 and Matthew's correlation coefficient (MCC) of 0.701, respectively. To evaluate the prediction performance objectively, an independent testing dataset is used to evaluate the proposed method. Encouragingly, our proposed method performs better than previous studies with a sensitivity of 0.853, a specificity of 0.815, an accuracy of 0.831 and MCC of 0.662 on the independent testing dataset. These results suggest that the proposed method can be a potential candidate for bacteriophage virion protein prediction, which may provide a useful tool to find novel antibacterial drugs and to understand the relationship between bacteriophage and host bacteria. For the convenience of the vast majority of experimental Int. J. Mol. Sci. 2015, 16,21735 scientists, a user-friendly and publicly-accessible web-server for the proposed ensemble method is established.
منابع مشابه
Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis.
The bacteriophage virion proteins play extremely important roles in the fate of host bacterial cells. Accurate identification of bacteriophage virion proteins is very important for understanding their functions and clarifying the lysis mechanism of bacterial cells. In this study, a new sequence-based method was developed to identify phage virion proteins. In the new method, the protein sequence...
متن کاملThe Effect of Herpes Simplex Virus Virion Host Shutoff Gene- a New Suicide Gene- on Tumor Cells
Background: The herpes simplex virus (HSV) UL41 gene product, virion host shutoff (Vhs) protein, mediates the rapid degradation of both viral and cellular mRNA. This ability suggests that Vhs protein can be used as a suicide gene in cancer gene therapy applications. The recent reports have shown that the degradation of cellular mRNA during herpes simplex infection is selective. RNA containing A...
متن کاملThree-dimensional reconstructions of the bacteriophage CUS-3 virion reveal a conserved coat protein I-domain but a distinct tailspike receptor-binding domain.
CUS-3 is a short-tailed, dsDNA bacteriophage that infects serotype K1 Escherichia coli. We report icosahedrally averaged and asymmetric, three-dimensional, cryo-electron microscopic reconstructions of the CUS-3 virion. Its coat protein structure adopts the "HK97-fold" shared by other tailed phages and is quite similar to that in phages P22 and Sf6 despite only weak amino acid sequence similarit...
متن کاملGenome sequence, structural proteins, and capsid organization of the cyanophage Syn5: a "horned" bacteriophage of marine synechococcus.
Marine Synechococcus spp and marine Prochlorococcus spp are numerically dominant photoautotrophs in the open oceans and contributors to the global carbon cycle. Syn5 is a short-tailed cyanophage isolated from the Sargasso Sea on Synechococcus strain WH8109. Syn5 has been grown in WH8109 to high titer in the laboratory and purified and concentrated retaining infectivity. Genome sequencing and an...
متن کاملA multivalent adsorption apparatus explains the broad host range of phage phi92: a comprehensive genomic and structural analysis.
Bacteriophage phi92 is a large, lytic myovirus isolated in 1983 from pathogenic Escherichia coli strains that carry a polysialic acid capsule. Here we report the genome organization of phi92, the cryoelectron microscopy reconstruction of its virion, and the reinvestigation of its host specificity. The genome consists of a linear, double-stranded 148,612-bp DNA sequence containing 248 potential ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 16 شماره
صفحات -
تاریخ انتشار 2015